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(57) Abstract 

There is disclosed a cancer diagnostic method based upon DNA methylation 
differences at specific CpG sites. As set forth in the Figure, the method comprises 
bisulfite treatment of DNA, followed by methylation-sensitive single nucleotide primer 
extension (Ms-SNuPE) for determination of strand-specific methylation status at cytosine 
residues. 
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A CANCER DIAGNOSTIC METHOD BASED UPON DNA METHYLATION 

DIFFERENCES 



Technical Field of the Invention 

The present invention provides a cancer diagnostic method based upon DNA 
methylation differences at specific CpG sites. Specifically, the inventive method provides for a 
bisulfite treatment of DNA, followed by methylation-sensitive single nucleotide primer 
extension (Ms-SNuPE), for determination of strand-specific methylation status at cytosine 
residues. 

Background of the Invention 

Cancer treatments, in general, have a higher rate of success if the cancer is diagnosed 
early and treatment is started earlier in the disease process. The relationship between improved 
prognosis and stage of disease at diagnosis hold across all forms of cancer for the most part. 
Therefore, there is an important need to develop early assays of general tumorigenesis through 
marker assays that measure general tumorigenesis without regard to the tissue source or cell 
type that is the source of a primary tumor. Moreover, there is a need to address distinct genetic 
alteration patterns that can serve as a platform associated with general tumorigenesis for early 
detection and prognostic monitoring of many forms of cancer. 
Importance of DNA Methylation 

DNA methylation is a mechanism for changing the base sequence of DNA without 
altering its coding function. DNA methylation is a heritable, reversible and epigenetic change. 
Yet, DNA methylation has the potential to alter gene expression, which has profound 
developmental and genetic consequences. The methylation reaction involves flipping a target 
cytosine out of an intact double helix to -allow the transfer of a methyl group from S- 
adenosylmethionine in a cleft of the enzyme DNA (cystosine-5)-methyltransferase 
(Klimasauskas et aL, Cell 76:357-369, 1994) to form 5-methylcytosine (5-mCyt). This 
enzymatic conversion is the only epigenetic modification of DNA known to exist in vertebrates 
and is essential for normal embryonic development (Bird, Cell 70:5-8, 1992; Laird and 
Jaenisch, Human Mol Genet. 3:1487-1495, 1994; and Bestor and Jaenisch, Cell 69:915-926, 
1992). The presence of 5-mCyt at CpG dinucleotides has resulted in a 5-fold depletion of this 
sequence in the genome during vertebrate evolution, presumably due to spontaneous 
deamination of 5-mCyt to T (Schoreret et aL, Proc. Natl Acad. Sci. USA 89:957-961, 1992). 
Those areas of the genome that do not show such suppression are referred to as "CpG islands" 
(Bird, Nature 321:209-213, 1986; and Gardiner-Garden et aL, J. Mol Biol 196:261-282, 
1987). These CpG island regions comprise about 1% of vertebrate genomes and also account 
for about 1 5% of the total number of CpG dinucleotides (Bird, Infra.). CpG islands are 
typically between 0.2 to about 1 kb in length and are located upstream of many housekeeping 
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and tissue-specific genes, but may also extend into gene coding regions. Therefore, it is the 
methylation of cytosine residues within CpG islands in somatic tissues, which is believed to 
affect gene function by altering transcription (Cedar, Cell 53:3-4, 1988). 

Methylation of cytosine residues contained within CpG islands of certain genes has 
5 been inversely correlated with gene activity. This could lead to decreased gene expression by a 
variety of mechanisms including, for example, disruption of local chromatin structure, 
inhibition of transcription factor-DNA binding, or by recruitment of proteins which interact 
specifically with methylated sequences indirectly preventing transcription factor binding. In 
other words, there are several theories as to how methylation affects mRNA transcription and 
10 gene expression, but the exact mechanism of action is not well understood. Some studies have 
demonstrated an inverse correlation between methylation of CpG islands and gene expression, 
however, most CpG islands on autosomal genes remain unmethylated in the germline and 
methylation of these islands is usually independent of gene expression. Tissue-specific genes 
are usually unmethylated and the receptive target* organs but are methylated in the germline and 
1 5 in non-expressing adult tissues. CpG islands of constitutively-expressed housekeeping genes 
are normally unmethylated in the germline and in somatic tissues. 

Abnormal methylation of CpG islands associated with tumor suppressor genes may also 
cause decreased gene expression. Increased methylation of such regions may lead to 
progressive reduction of normal gene expression resulting in the selection of a population of 
20 cells having a selective growth advantage {i.e., a malignancy). 

It is considered that altered DNA methylation patterns, particularly methylation of 
cytosine residues, cause genome instability and are mutagenic. This, presumably, has led to an 
80% suppression of a CpG methyl acceptor site in eukaryotic organisms, which methylate their 
genomes. Cytosine methylation further contributes to generation of polymorphism and germ- 
25 line mutations and to transition mutations that inactivate tumor-suppressor genes (Jones, 

Cancer Res. 56:2463-2467, 1996). Methylation is also required for embryonic development of 
mammals (Bestor and Jaenisch, Cell 69:915-926, 1992). It appears that that the methylation of 
CpG-rich promoter regions may be blocking transcriptional activity. Therefore, there is a 
probability that alterations of methylation are an important epigenetic criteria and can play a 
30 role in carcinogenesis in general due to its function of regulating gene expression. Ushijima et 
al. (Proc. Natl. Acad. ScL USA 94:2284-2289, 1997) characterized and cloned DNA fragments 
that show methylation changes during murine hepatocarcinogenesis. Data from a group of 
studies of altered methylation sites in cancer cells show that it is not simply the overall levels 
of DNA methylation that are altered in cancer, but changes in the distribution of methyl 
35 groups. 

These studies suggest that methylation, at CpG-rich sequences known as CpG islands, 
provide an alternative pathway for the inactivation of tumor suppressors, despite the fact that 
the supporting studies have analyzed only a few restriction enzyme sites without much 
knowledge as to their relevance to gene control. These reports suggest that methylation of 
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CpG oligonucleotides in the promoters of tumor suppressor genes can lead to their inactivation. 
Other studies provide data that suggest that alterations in the normal methylation process are 
associated with genomic instability (Lengauer et al. Proc. Natl Acad. Sci. USA 94:2545-2550, 
1997). Such abnormal epigenetic changes may be found in many types of cancer and can, 

5 therefore, serve as potential markets for oncogenic transformation, provided that there is a 
reliable means for rapidly determining such epigenetic changes. The present invention was 
made to provide such a universal means for determining abnormal epigenetic changes and 
address this need in the art. 
Methods to Determine DNA Methylation 

1 0 There is a variety of genome scanning methods that have been used to identify altered 

methylation sites in cancer cells. For example, one method involves restriction landmark 
genomic scanning (Kawai et al., Mol Cell. Biol 14:7421-7427, 1994), and another example 
involves methylation-sensitive arbitrarily primed PCR (Gonzalgo et al., Cancer Res. 57:594- 
599, 1997). Changes in methylation patterns at specific CpG sites have been monitored by 

15 digestion of genomic DNA with methylation-sensitive restriction enzymes followed by 
Southern analysis of the regions of interest (digestion-Southern method). The digestion- 
Southern method is a straightforward method but it has inherent disadvantages in that it 
requires a large amount of DNA (at least or greater than 5 p,g) and has a limited scope for 
analysis of CpG sites (as determined by the presence of recognition sites for methylation- 

20 sensitive restriction enzymes). Another method for analyzing changes in methylation patterns 
involves a PCR-based process that involves digestion of genomic DNA with methylation- 
sensitive restriction enzymes prior to PCR amplification (Singer-Sam et al., Nucl Acids Res. 
1 8:687,1990). However, this method has not been shown effective because of a high degree of 
false positive signals (methylation present) due to inefficient enzyme digestion of 

25 overamplification in a subsequent PCR reaction. 

Genomic sequencing has been simplified for analysis of DNA methylation patterns and 
5-methylcytosine distribution by using bisulfite treatment (Frommer et aL, Proc. Natl Acad. 
Sci. C/S/i 89:1827-1831, 1992). Bisulfite treatment of DNA distinguishes methylated from 
unmethylated cytosines, but original bisulfite genomic sequencing requires large-scale 

30 sequencing of multiple plasmid clones to determine overall methylation patterns, which 

prevents this technique from being commercially useful for determining methylation patterns 
in any type of a routine diagnostic assay. 

In addition, other techniques have been reported which utilize bisulfite treatment of 
DNA as a starting point for methylation analysis. These include methylation-specific PCR 

35 (MSP) (Herman et al. Proc. Natl. Acad Sci. USA 93:9821-9826, 1992); and restriction enzyme 
digestion of PCR products amplified from bisulfite-converted DNA (Sadri and Hornsby, Nucl 
Acids Res. 24:5058-5059, 1996; and Xiong and Laird, Nucl Acids Res. 25:2532-2534, 1997). 

PCR techniques have been developed for detection of gene mutations (Kuppuswamy et 
al., Proc. Natl Acad. Sci. USA 88:1 143-1 147, 1991) and quantitation of allelic-specific 
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expression (Szabo and Mann, Genes Dev. 9:3097-3108, 1995; and Singer-Sam et ah, PCR 
Methods AppL 1 :1 60-163, 1992). Such techniques use internal primers, which anneal to a 
PCR-generated template and terminate immediately 5' of the single nucleotide to be assayed. 
However an allelic-specific expression technique has not been tried within the context of 
5 assaying for DNA methylation patterns. 

Therefore, there is a need in the art to develop improved diagnostic assays for early 
detection of cancer using reliable and reproducible methods for determining DNA methylation 
patterns that can be performed using familiar procedures suitable for widespread use. This 
invention was made to address the foregoing need. 

10 

Summary of the Invention 

The present invention provides a method for determining DNA methylation patterns at 
cytosine sites, comprising the steps of: 

(a) obtaining genomic DNA from a DNA sample to be assayed; 
1 5 (b) reacting the genomic DNA with sodium bisulfite to convert unmethylated 

cytosine residues to uracil residues while leaving any 5-methylcytosine residues unchanged to 
provide primers specific for the bisulfite-converted genomic sample for top strand or bottom 
strand methylation analysis; 

(c) performing a PCR amplification procedure using the top strand or bottom strand 
20 specific primers; 

(d) isolating the PCR amplification products; 

(e) performing a primer extension reaction using Ms-SNuPE primers, [ 32 P]dNTPs 
and Tag polymerase, wherein the Ms-SNuPE primers comprise from about a 1 5 mer to about a 
22 mer length primer that terminates immediately 5' of a single nucleotide to be assayed; and 

25 (f) determining the relative amount of methylation at CpG sites by measuring the 

incorporation of different 32 P-labeled dNTPs. 

Preferably, the [ 32 P]NTP for top strand analysis is [ 32 P]dCTP or [ 32 P]TTP. Preferably, 
the [ 32 P]NTP for bottom strand analysis is [ 32 P]dATP or [ 32 P]dGTP. Preferably, the isolation 
step of the PCR products uses an electrophoresis technique. Most preferably, the 

30 electrophoresis technique uses an agarose gel. Preferably, the Ms-SNuPE primer sequence 
comprises a sequence of at least fifteen but no more than twenty five, bases having a sequence 
selected from the group consisting of GaLl [SEQ ID NO. 1], GaL2 [SEQ ID NO. 2], GaL4 
[SEQ ID NO. 3], HuNl [SEQ ID NO. 5], HuN2 [SEQ ID NO. 6], HuN3 [SEQ ID NO. 7], 
HuN4 [SEQ ID NO. 8], HuN5 [SEQ ID NO. 8], HuN6 [SEQ ID NO. 9], CaSl [SEQ ID NO. 

35 10], CaS2 [SEQ ID NO. 1 1], CaS4 [SEQ ID NO. 12], and combinations thereof. 

The present invention further provides a Ms-SNuPE primer sequence designed to 
anneal to and terminate immediately 5' of a desired cytosine codon in the CpG target site and 
that is located 5' upstream from a CpG island and are frequently hypermethylated in promoter 
regions of somatic genes in malignant tissue. Preferably, the Ms-SNuPE primer sequence 
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comprises a sequence of at least fifteen bases having a sequence selected from the group 
consisting of GaLl [SEQ ID NO. 1], GaL2 [SEQ ID NO. 2], GaL4 [SEQ ID NO. 3], HuNl 
[SEQ ID NO. 5], HuN2 [SEQ ID NO. 6], HuN3 [SEQ ID NO. 7], HuN4 [SEQ ID NO. 8], 
HuN5 [SEQ ID NO. 8], HuN6 [SEQ ID NO. 9], CaSl [SEQ ID NO. 10], CaS2 [SEQ ID NO. 

5 11], CaS4 [SEQ ID NO. 12], and combinations thereof. The present invention further provides 
a method for obtaining a Ms-SNuPE primer sequence, comprising finding a hypermethylated 
CpG island in a somatic gene from a malignant tissue or cell culture, determining the sequence 
located immediately 5' upstream from the hypermethylated CpG island, and isolating a 15 to 
25 mer sequence 5' upstream from the hypermethylated CpG island for use as a Ms-SNuPE 

1 0 primer. The present invention further provides a Ms-SNuPE primer comprising a 1 5 to 25 mer 
oligonucleotide sequence obtained by the process comprising, finding a hypermethylated CpG 
island in a somatic gene from a malignant tissue or cell culture, determining the sequence 
located immediately 5' upstream from the hypermethylated CpG island, and isolating a 1 5 to 
25 mer sequence 5' upstream from the hypermethylated CpG island for use as a Ms-SNuPE 

1 5 primer. 

Brief Description of the Drawings 

Figure 1 shows a diagram of the inventive Ms-SNuPE assay for determination of 
strand-specific methylation status at cytosines. The process involves treating genomic DNA 

20 with sodium bisulfite, and generating a template by a PCR technique for a top strand 
methylation analysis. Alternatively a bottom strand methylation can also be assayed by 
designing the appropriate primers to generate a bottom strand-specific template. The process 
further entails amplifying the templates by a PCR technique. The PCR products are 
electrophoresed and isolated from agarose gels, followed by incubation with Ms-SNuPE 

25 primers, as disclosed herein wherein the Ms-SNuPE primers comprise a from about a 15 mer to 
about a 25 mer length primer that terminates immediately 5' of a single nucleotide to be 
assayed, and PCR buffer, [ 32 P]dNTPs and Taq polymerase for primer extension reactions. The 
radiolabeled products are separated, for example, by electrophoresis on polyacrylamide gels 
under denaturing conditions and visualized by exposure to autoradiographic film or 

30 phosphorimage quantitation. 

Figure 2 shows the results from a quantitative methylation analysis of three top strand 
CpG sites from a 5' CpG island of pi 6. PI 6 is a known tumor suppressor gene and the 
particular region examined for changes in methylation is the promoter region of this gene. The 
top panel provides the locations of three sites analyzed (numbered 1 , 2 and 3) relative to the 

35 putative transcriptional start sites (vertical arrows pointing upwards) and the exon la coding 
domain. The PCR primers used for top strand amplification of the 5' region of pi 6 (which 
includes putative transcriptional start sites) were 5'-GTA GGT GGG GAG GAG TTT AGT T- 
3* [SEQ ID NO. 13] and 5'-TCT AAT AAC CAA CCA ACC CCT CC-3' [SEQ ID NO. 14]. 
The control sets included "M" PCR product amplified from a plasmid containing bisulfide- 
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specific methylated sequence; M U" PCR product amplified from a plasmid containing bisulfite- 
specific unmethylated sequence; and "mix" a 50:50 mixture of methylated and unmethylated 
PCR-amplified plasmid sequences. The DNA samples analyzed included T24 and J82 bladder 
cancer cell lines; wbc (white blood cell), melanoma (primary melanoma tumor tissue sample), 
5 and bladder (primary bladder tumor tissue sample). The tissue samples were micro dissected 
from paraffin-embedded tumor material. The grid at the bottom of the lower panel shows the 
ratio of methylated (C) versus unmethylated (T) bands at each site based upon phosphorimage 
quantitation. 

Figure 3 shows a mixing experiment showing a linear response of the inventive Ms- 

10 SNuPE assay for detection of cytosine methylation. A T24 bladder cancer cell line DNA 

(predominantly methylated) was added in increasing amounts to a J82 bladder cancer cell line 
DNA (predominantly unmethylated). Figure 3 shows data from an 18 mer oligonucleotide 
[SEQ ID NO. 16] which was used in multiplex analysis of CpG methylation (site 2) of the pi 6 
5'CpG in combination with a 1 5-mer and 21 -mer primer [SEQ ID NOS 17 and 15, 

1 5 respectively] (correlation coefficient =0.99). Both the 1 5 mer and 21^mer produced a nearly 
identical linear response as the 1 8-mer. Figure 3 shows data from three separate experiments. 

Figure 4 shows a schematic diagram that outlines a process for a high-throughput 
methylation analysis. The Ms-SNuPE primer extension reactions are performed and then the 
products are directly transferred to membranes, preferably nylon membranes. This allows for a 

20 large number of samples to be analyzed simultaneously in a high-density format. The 

membrane is washed and exposed to a phosphorimage cassette for quantitative methylation 
analysis and eliminate the need for polyacrylamide gel electrophoresis for data measurement. 

Figure 5 (Panel A) shows results from quantitative analysis of DNA methylation using 
the Ms-SNuPE blot transfer technique of Figure 4. Levels of DNA methylation in matched 

25 normal and tumor colon specimens were analyzed in the 5 1 promoter region of the pi 6 gene. 
The average methylation of 3 sites in the pi 6 promoter (Figure 2) was determined by 
quantitating the C:T signal ration by phosphorimage analysis. Panel B shows the results of 
quantitating the average methylation of 3 CpG sites using standard polyacrylamide gel 
electrophoresis compared to dot blot transfers. The average methylation of the monitored sites 

30 in various colon specimens is plotted on the graph and shows little difference between 

quantitated values derived from polyacrylamide gel electrophoresis compared torn the dotblot 
technique. These data show the feasibility of using the Ms-SNuPE dotblot procedure for high- 
throughput detection and quantitation of DNA methylation changes in cancer cells. 

35 Detailed Description of the Invention 

The present invention provides a method for determining DNA methylation patterns at 
cytosine sites, comprising the steps of: 
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(a) obtaining genomic DNA from a DNA sample to be assayed, wherein sources of 
DNA include, for example, cell lines, blood, sputum, stool, urine, cerebrospinal fluid, paraffin- 
embedded tissues, histological slides and combinations thereof; 

(b) reacting the genomic DNA with sodium bisulfite to convert unmethylated 

5 cytosine residues to uracil residues while leaving any 5-methylcytosine residues unchanged to 
provide primers specific for the bisulfite-converted genomic sample for top strand or bottom 
strand methylation analysis; 

(c) performing a PCR amplification procedure using the top strand or bottom strand 
specific primers; 

1 0 (d) isolating the PCR amplification products; 

(e) performing a primer extension reaction using Ms-SNuPE primers, [ 32 P]dNTPs 
and Taq polymerase, wherein the Ms-SNuPE primers comprise a from about a 1 5 mer to about 
a 22 mer length primer that terminates immediately 5' of a single nucleotide to be assayed; and 

(f) determining the relative amount of allelic expression of CpG methylated sites 
1 5 by measuring the incorporation of different 32 P-labeled dNTPs. 

Preferably, the [ 32 P]NTP for top strand analysis is [ 32 P]dCTP or [ 32 P]TTP. Preferably, 
the [ 32 P]NTP for bottom strand analysis is [ 32 P]dATP or [ 32 P]dGTP. Preferably, the isolation 
step of the PCR products uses an electrophoresis technique. Most preferably, the 
electrophoresis technique uses an agarose gel. 

20 DNA is isolated by standard techniques for isolating DNA from cellular, tissue or 

specimen samples. Such standard methods are found in textbook references such as Fritsch 
and Maniatis eds., Molecular Cloning: A Laboratory Manual, 1989. 

The bisulfite reaction is performed according to standard techniques. For example and 
briefly, approximately 1 microgram of genomic DNA (amount of DNA can be less when using 

25 micro-dissected DNA specimens) is denatured for 15 minutes at 45 °C with 2N NaOH 

followed by incubation with 0.1M hydroquinone and 3.6M sodium bisulfite (pH 5.0) at 55 °C 
for 12 hours (appropriate range is 4-12 hours). The DNA is then purified from the reaction 
mixture using standard (commercially-available) DNA miniprep columns, or other standard 
techniques for DNA purification are also appropriate. The purified DNA sample is 

30 resuspended in 55 microliters of water and 5 microliters of 3N NaOH is added for a 

desulfonation reaction, preferably performed at 40 °C for 5-10 minutes. The DNA sample is 
then ethanol-precipitated and washed before being resuspended in an appropriate volume of 
water. Bisulfite treatment of DNA distinguishes methylated from unmethylated cytosines. 
The present bisulfite treatment method has advantages because it is quantitative, does not use 

35 restriction enzymes, and many CpG sites can be analyzed in each primer extension reaction by 
using a multiplex primer strategy. 

The PCR amplification step (c) can be performed by standard PCR techniques, 
following a manufacturer's instructions. For example, approximately 1-2 microliters of the 
bisulfite-treated DNA. was used as a template for strand-specific PCR amplification in a region 
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of interest. In a PCR reaction profile for amplifying a portion of the pi 6 5* CpG island, for 
example, a procedure of initial denaturation of 94 °C for 3 minutes followed by a cycle of 94 
°C of 30 seconds, 68 °C for 30 seconds, 72 °C for 30 seconds for a total of 30 cycles. The 
PCR reactions were performed in 25 microliter volumes under conditions of: -50 ng bisulfite- 
converted DNA (less for micro dissected samples), 10 mM Tris-HCl (pH 8.3), 1.5 mM MgCl 2 , 
50 mM KC1, 0.1% gelatin/ml, 100 |iM of each of dNTP, 0.5 final concentration of each 
primer and 1 unit of Taq polymerase. There are many chromatographic techniques that can be 
used to isolate the PCR amplification products. In one illustrative procedure, approximately 
10-25 microliters of the amplified PCR products were loaded onto 2% agarose gels and 
electrophoresed. The bands were visualized and isolated using standard get purification 
procedures. 

The primer extension reaction is conducted using standard PCR primer extension 
techniques but using Ms-SNuPE primers as provided herein. Approximately 10-50 nanograms 
of purified PCR template is used in each Ms-SNuPE reaction. A typical reaction volume is 
about 25 microliters and comprises PCR template (about 10-50 ng), IX PCR buffer, 1 (aM of 
each Ms-SNuPE primer, 1 \xC\ of the appropriate 32 P-labeled dNTP (either [ 32 P]dCTP, 
[ 32 P]TTP, [ 32 P]dATP, [ 32 P]dGTP or combinations thereof), and 1 unit of Taq polymerase. As a 
general rule, oligonucleotides used in the primer extension reactions were designed to have 
annealing temperatures within 2-3 °C of each other and did not hybridize to sequences that 
originally contained CpG dinucleotides. The Ms-SNuPE reactions were performed at 95 °C 
for 1 minute, 50 °C for 2 minutes, and 72 °C for 1 minute. A stop solution (10 microliters) was 
added to the mixtures to terminate the reactions. The inventive Ms-SNuPE assay utilizes 
internal primer(s) which anneal to a PCR-generated template and terminate immediately 5' of 
the single nucleotide to be assayed. A similar procedure has been used successfully for 
detection of gene mutations Kuppuswamy et al., Proc. Natl. Acad. ScL USA 88:1 143-1 147, 
1991) and for quantitation of allele-specific expression (Szabo and Mann, Genes Dev. 9:3097- 
3108, 1995 and Greenwood and Burke, Genome Res. 6:336-348, 1996). 

There are several techniques that are able to determine the relative amount of 
methylation at each CpG site, for example, using a denaturing polyacrylamide gel to measure 
32 P through phosphorimage analysis, or transfer of Ms-SNuPE reaction products to nylon 
membranes, or even using fluorescent probes instead of a 32 P marker. In one method for 
determining the relative amount of methylation at each CpG site, approximately 1-2 microliters 
of each Ms-SNuPE reaction product was electrophoresed onto 15% denaturing polyacrylamide 
gel (7M urea). The gels were transferred to filter paper and then dried. Phosphorimage 
analysis was performed to determine the relative amount of radiolabeled incorporation. An 
alternative method for determining the relative amount of methylation at individual CpG sites 
is by a direct transfer of the Ms-SNuPE reaction products to nylon membranes. This technique 
can be used to quantitate an average percent methylation of multiple CpG sites without using 
polyacrylamide gel electrophoresis. High-throughput methylation analysis was performed by 
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direct transfer of the Ms-SNuPE reactions onto nylon membranes. A total of 100 microliters or 
.0.4 mM NaOH, 1 mM Na 4 P 2 0 7 was added to the completed primer extension reactions instead 
of adding stop solution. The mixture was directly transferred to nylon membranes using a 
dotblot vacuum manifold in a 96 well plate format. Each vacuum transfer well was washed a 
5 total of 4 times with 200 microliters of 2X SSC, 1 mM Na 4 P 2 0 7 . The entire membrane was 
washed in 2X SSC, 1 mM Na 4 P 2 0 7 . The radioactivity of each spot on the dried nylon 
membrane was quantitated by phosphorimaging analysis. 

In the inventive quantitative Ms-SNuPE assay, the relative amount of allelic expression 
is quantitated by measuring the incorporation of different 32 P-labeled dNTPs. Figure 1 outlines 
10 how the assay can be utilized for quantitative methylation analysis. For example, the initial 
treatment of genomic DNA with sodium bisulfite causes unmethylated cytosine to be 
converted to uracil, which is subsequently replicated as thymine during PCR. Methylcytosine 
is resistant to deamination and is replicated as cytosine during amplification. Quantitation of 
the ratio of methylated versus unmethylated cytosine (C versus T) at the original CpG sites can 
1 5 be determined by incubating a gel-isolated PCR product, primer(s) and Taq polymerase with 
either [ 32 P]dCTP or [ 32 P]TTP, followed by denaturing polyacrylamide gel electrophoresis and 
phosphorimage analysis. In addition, opposite strand (bottom strand) Ms-SNuPE primers are 
further designed which would incorporate either [ 32 P]dATP or [ 32 P]dGTP to assess methylation 
status depending on which CpG site is analyzed. 
20 Ms-SNuPE Primers 

The present invention further provides a Ms-SNuPE primer sequence designed to 
anneal to and terminate immediately 5' of a desired cytosine codon in the CpG target site and 
that is located 5 ! upstream from a CpG island and are frequently hypermethylated in promoter 
regions of somatic genes in malignant tissue. Preferably, the Ms-SNuPE primer sequence 
25 comprises a sequence of at least fifteen bases having a sequence selected from the group 
consisting of GaLl [SEQ ID NO. 1], GaL2 [SEQ ID NO. 2], GaL4 [SEQ ID NO. 3], HuNl 
[SEQ ID NO. 5], HuN2 [SEQ ID NO. 6], HuN3 [SEQ ID NO. 7], HuN4 [SEQ ID NO. 8], 
HuN5 [SEQ ID NO. 8], HuN6 [SEQ ID NO. 9], CaSl [SEQ ID NO. 10], CaS2 [SEQ ID NO. 
11], CaS4 [SEQ ID NO. 12], and combinations thereof. The present invention further provides 
30 a method for obtaining a Ms-SNuPE primer sequence, comprising finding a hypermethylated 
CpG island in a somatic gene from a malignant tissue or cell culture, determining the sequence 
located immediately 5' upstream from the hypermethylated CpG island, and isolating a 15 to 
25 mer sequence 5' upstream from the hypermethylated CpG island for use as a Ms-SNuPE 
primer. The present invention further provides a Ms-SNuPE primer comprising a 15 to 25 mer 
35 oligonucleotide sequence obtained by the process comprising, (a) identifying hypermethylated 
CpG islands a somatic gene from a malignant tissue or cell culture source, (b) determining the 
sequence located immediately 5' upstream from the hypermethylated CpG island, and (c) 
isolating at least a 15 mer sequence 5* upstream from the hypermethylated CpG island for use 



9 



WO 98/56952 



PCT/US98/11896 



as a Ms-SNuPE primer. Preferably the Ms-SNuPE primer sequence is from about 15 to about 
25 base pairs in length. 

The ability to detect methylation changes associated with oncogenic transformation is 
of critical importance in understanding how DNA methylation may contribute to 
tumorigenesis. Regions of DNA that have tumor-specific methylation alterations can be 
accomplished using a variety of techniques. This will permit rapid methylation analysis of 
specific CpG sites using the inventive quantitative Ms-SNuPE primer process. For example, 
techniques such as restriction landmark genomic scanning (RLGS) (Hatada et ah, Proc. Natl 
Acad. Set USA 88:9523-9527, 1995), methylation-sensitive-representational difference 
analysis (MS-RDA) (Ushijima et al., Proc. Natl. Acad ScL USA 94:2284-2289, 1997) and 
methylation-sensitive arbitrarily primed PCR (AP-PCR) (Gonzalgo et al., Cancer Res. 57: 594- 
599, 1997) can be used for identifying and characterizing methylation differences between 
genomes. 

Briefly, sequence determinations of regions of DNA that show tumor-specific 
methylation changes can be performed using standard techniques, such as those procedures 
described in textbook references such as Fritsch and Maniatis eds., Molecular Cloning: A 
Laboratory Manual, 1989. Additionally, commercially available kits or automated DNA 
sequencing systems can be utilized. Once specific regions of DNA have been identified by 
using such techniques, the Ms-SNuPE primers can be applied for rapidly screening the most 
important CpG sites that are involved with the specific methylation changes associated with a 
cancer phenotype. 

Example 1 

This example illustrates a quantitative methylation analysis of three top strand sites in a 
5' CpG island of pi 6 in various DNA samples using the inventive method. The top panel 
provides the locations of three sites analyzed (numbered 1, 2 and 3) relative to the putative 
transcriptional start sites (vertical arrows pointing upwards) and the exon la coding domain. 
The PCR primers used for top strand amplification of the 5' region of pi 6 (which includes 
putative transcriptional start sites) were 5'-GTA GGT GGG GAG GAG TTT AGT T-3' [SEQ 
ID NO. 13] and 5'-TCT AAT AAC CAACCA ACC CCT CC-3* [SEQ ID NO. 14]. The 
reactions were performed in 25 |ii total volume under the conditions of 50 ng bisulfite-treated 
DNA, 10 mM Tris-HCl (pH 8.3), 1.5 mM MgCl 2 , 50 mM KC1, 0.1% gelatin/ml, 100 jiM of 
each dNTP, 0.5 \iM final concentration of each primer and 1U of Taq polymerase (Boehinger 
Mannheim, Indianapolis, IN). The reactions were hot-started using a 1:1 mixture of 
TaqlTaqStart antibody (Clontech, Palo Alto, CA). 

An initial denaturation of 94 °C for 3 minutes was followed by 94 °C for 30 sec, 68 °C 
for 30 sec, 72 °C for 30 sec for a total of 35 cycles. The PCR products were separated by 
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electrophoresis on 2% agarose gels and the bands were isolated using a Qiaquick™ gel 
extraction kit (Qiagen, Santa Clarita, CA). 

The Ms-SNuPE reaction was performed in a 25 ml reaction volume with 10-50 ng of PCR 
template incubated in a final concentration of 1 x PCR buffer, 1 |iM of each Ms-SNuPE 

5 primer, 1 ^iCi of either [ 32 P]dCTP or [ 32 P]TTP and 1U of Taq polymerase. The primer 
extensions were also hot-started using a 1 : mixture of TaqlTaqStart antibody. The primers 
used for the Ms-SNuPE analysis were: site 1 5'-TTT TTT TGT TTG GAA AGA TAT-3' [SEQ 
ID NO. 15]; site 2 5'-TTT TAG GGG TGT TAT ATT-3' [SEQ ID NO. 16]; site 3 5'-TTT GAG 
GGA TAG GGT-3' [SEQ ID NO. 17]. The conditions for the primer extension reactions were 

10 95 °C for 1 minute, 50 °C for 2 minutes and 70 °C for 1 minute. A stop solution (10 \x\) was 
added to the reaction mixtures and the samples were loaded onto 1 5% denaturing 
polyacrylamide gels (7 M urea). Radioactivity of the bands was quantitated by 
phosphorimaging analysis. The control sets included "M" PCR product amplified from a 
plasmid containing bisulfide-specific methylated sequence; "U" PCR product amplified from a 

15 plasmid containing bisulfite-specific unmethylated sequence; and "mix" a 50:50 mixture of 
methylated and unmethylated PCR-amplified plasmid sequences. The DNA samples analyzed 
included T24 and J82 bladder cancer cell lines; wbc (white blood cell), melanoma (primary 
melanoma tumor tissue sample), and bladder (primary bladder tumor tissue sample). The 
tissue samples were micro dissected from paraffin-embedded tumor material. The grid at the 

20 bottom of the lower panel shows the ratio of methylated (C) versus unmethylated (T) bands at 
each site based upon phosphorimage quantitation. 

These data (Figure 2) show the ability of the inventive assay to detect altered patterns 
of methylation. 

25 Example 2 

This example illustrates a mixing experiment showing a linear response of the inventive 
Ms-SNuPE assay for detection of cytosine methylation. A T24 bladder cancer cell line DNA 
(predominantly methylated) was added in increasing amounts to a J82 bladder cancer cell line 
DNA (predominantly unmethylated). Figure 3 shows data from an 18 mer oligonucleotide 

30 [SEQ ID NO. 1 6] which was used in multiplex analysis of CpG methylation (site 2) of the pi 6 
5'CpG in combination with a 15-mer and 21 -mer primer [SEQ ID NOS 17 and 15, 
respectively] (correlation coefficient =0.99). Both the 15 mer and 21 -mer produced a nearly 
identical linear response as the 18-mer. Figure 3 shows data from three separate experiments. 
Differential specific activity and incorporation efficiency of each [ 32 P]dNTP was controlled for 

35 by using a 50:50 mixture of bisulfite-specific methylated versus unmethylated PCR template 
for analysis. 

Example 3 

This example provides a summary of DNA regions for which Ms-SNuPE primers can 
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be designed and the inventive method applied for a quantitative detection of abnormal DNA 
methylation in cancer cells. The sequences are listed according to name, size and frequency of 
hypermethylation in the corresponding cell line or primary tumor. 



fragment 


size (bp) 


methylated in 
colon cell line 


hypermethylated 
in colon cancer 


lypermethylated 
in bladder cancer 


comments 


GaLl 


530 


7/7 (100%) 


3/7 (42%) 


3/7 (42%) 


GC content (0.6), 
observed/expected 
CpG (0.63) 


GaL2 


308 


7/7(100%) 


4/5 (80%) 


6/7 (85%) 


GC content (0.6), 
observed/expected 
CpG (0.6) 


GaL4 


177 


7/7 (100%) 


1/2 (50%) 


Va (75%) 


GC content (0.59), 
observed/expected 
CpG (0.50) 


CaSl 


215 


4/7 (57%) 


0/5 (0%) 


2/7 (28%) 


GC content (0.55), 
observed/expected 
CpG (0.78) 


CaS2 


220 


4/7 (57%) 


3/5 (60%) 


3/7 (42%) 


GC content (0.54), ! 
observed/expected 
CpG (0.74) 


CaS4 


196 


6/7 (85%) 


0/5 (0%) 


1/7(14%) 


GC content (0.64), 
observed/expected 
CpG (0.84) 


HuNl 


148 


7/7(100%) 


3/5 (60%) 


3/7 (42%) 


GC content (0.54), 
observed/expected 
CpG (0.99) 


HuN2 


384 


7/7(100%) 


4/5 (80%) 


2/7 (28%) 


GC content (0.6), 
observed/expected 
CpG (0.62) 


HuN3 


178 


6/7 (85%) 


4/5 (80%) 


3/7 (42%) 


GC content (0.53), 
observed/expected 
CpG (0.97) 


HuN4 


359 


7/7(100%) 


3/5 (60%) 


4/7 (57%) 


GC content (0.51), 
observed/expected 
CpG (0.47) 
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HuN5 


251 


7/7(100%) 


2/5 (40%) 


5/7 (71%) 


GC content (0.63), 
observed/expected 
CpG (0.77) 


HuN6 


145 


6/7 (85%) 


3/4 (75%) 


1/2 (50%) 


GC content (0.55), 
observed/expected 
CpG (0.47) 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 (i) APPLICANTS: Mark L. Gonzalgo and Peter A. Jones 

(ii) TITLE OF INVENTION: A CANCER DIAGNOSTIC METHOD BASED 
UPON DNA METHYLATION DIFFERENCES 

,10 (iii) NUMBER OF SEQUENCES: 17 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DAVIS WRIGHT TREMAINE 

(B) STREET: 26 00 Century Square, 1501 Fourth Avenue 
15 (C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: U.S.A. 

(F) ZIP: 98101 

20 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) • COMPUTER: PC compatible 

(C) OPERATING SYSTEM: Windows95 

(D) SOFTWARE: Word 

25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: to be assigned 

(B) FILING DATE: 

(C) CLASSIFICATION: 

30 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Oster, Jeffrey B. 

(B) REGISTRATION NUMBER: 32,585 

(C) REFERENCE/DOCKET NUMBER: 47675-2 

35 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 2 06 628 7711 

(B) TELEFAX: 206 628 7699 

40 (2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 0 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: GaLl 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



1 CCCGCGACCT AAGCCAGCGA CTTACCACGT TAGTCAGCTA AGAAGTGGCA 5 0 

51 GAGCTGGGAT TCGAACCTAT AAAGAACTCT GAAGCCTGGG TATTTTTACA 10 0 

55 101 TGACACTTTA CATAATGCGC CACGGGGTAG TCGGAGGGGG AGGTCCATCT 150 

151 CCCTTTCCCT TGCTGTCCAT CTCCACAGAA AAGAAGCAAG TGGAGGACAG 200 

2 01 GAGCCAGAAA GTCATCTGGC CGCGGATCAT TCCGGAGTGA CCCCCGCCGC 250 

251 CACCACTCGC ATAGTCCGCT TATGGCGGGA GGGCACCTCA GAGATTCTCA 3 00 
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3 01 CAGGGGCTGT GCGGCCAGAA CCAGAAGTGC AAAGCACCGT TAGCGACTCT 3 50 

3 51 ATCGCCCCCT GCCGCCTGTG GCGCCCAGTC CGAAGCTGCT GTTTTCAGGA 4 00 

4 01 GGGCTAGTGG GCTAAGAAAA GAGCTCACCG ACTGACTGCC CAACAGCTGT 4 50 
4 51 TGCGAGCCAG TGCTAGGCTG CAGACAGCCT TGCCAAATGT GGTGACATAA 500 

5 5 01 GCGGGAGGGG GGAACATTTA GAGAGCCCTA 53 0 



(2) INFORMATION FOR SEQ ID NO : 2 : 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 08 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



15 



(ii) MOLECULE TYPE: GaL2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

20 1 CTAGGGTAGG CTGGTCTGTG CTGGATACGC GTGTTCTTCT GCGGAGTTAA 50 

51 AGGGTCGGGG ACGGGGGTTC TGGACTTACC AGAGCAATTC CAGCCGGTGG 100 
101 GCGTTTGACA GCCACTTAAG GAGGTAGGGA AAGCGAGCTT CACCGGGCGG 150 
151 GCTACGATGA GTAGCATGAC GGGCAGCAGC AGCAGCAGCC AGCAAAAGCC 200 
201 TAGCAAAGTG TCCAGCTGCT GCACTGCCGC GGGGACTCCC ACATCACCAT 250 

25 251 GACTAGTTGT GCAACTCTGC AGCAGAAACG GCTTCCGAGG AACACAGGAT 3 00 
3 01 CGCGGGGG 3 08 



(2) INFORMATION FOR SEQ ID NO: 3: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: GaL4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 



40 



1 GCTTCCTTTT TCTCGGCTTT CCTCACTATC CTCTCCCTGT TCGAGAGTAT 50 

51 CTCCACCAGC ACCGAGCCTC ACACGGGCTG TGCCTCCATC TTTGGAATGC 100 

101 CTACCCTTCT TTCTTGCGAA GCCCCTCCCA GGGCCAGCCC TTGTGCACCG 150 
45 151 GCTCAAGGGG ACTGCTCTCC TGCCTCG 177 



(2) INFORMATION FOR SEQ ID NO: 4: 



50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 

(B) TYPE: oligonucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



55 



(ii) MOLECULE TYPE: HuNl 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 
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1 TTGCGCCGAT CGTCAAGAAC CTCTCATCCC TGGCAGCAGC AAAGCCAATA 50 

51 TATTTCCATT TCTTATTTCA GTTTGCCACC AAAACAAAGC TGCGCGCGGC 100 

101 TGAGGGCAGG AAGGCGCTGA GACCGACCGA GAAGAAGGGA CGTCCCGG 14 8 

5 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 384 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 



15 



45 



55 



(ii) MOLECULE TYPE: HuN2 primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 



1 CAGGCCCGCC GAGACTCCAC TCCAACTACC AGGAAATTTC CCGTGGAGCT 50 
51 TCAATTCCTG GGACCCTCCT ACTGCGGGGA GAGTGGTTTC CCTGCCCCAC 100 
20 101 ACCATGCCCT AGGCCCGAGT CTGCGGCTCT TGGGGGATCT CTCCGAGCTC 15 0 
151 CGACACCGTG TTCGGACCGG GTGCGCCCTG CCGCTGGGGC TCAAGCCTGC 2 00 
201 AGGCGTGAGA ACCGGGGGAC TCTCTATGGC ACCAAGAGCT TCACCGTGAG 250 

2 51 CGTAGGCAGA AGCTTCGCTT TGATCCTAGG GCTTACAAAG TCCTCCTTTG 3 00 

3 01 GCTGCCCATG ATGGTAAAAG GGCAGTTGCT CACAAAGCGC GAGTGTGTGT 3 50 
25 3 51 GCCAGACAGT GTAAATGAGT GTTGGGACCG GCGT 3 84 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 178 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

35 (ii) MOLECULE TYPE: HuN3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

1 GGGTCCGTTC GTGAATGCAT GAGCAGGGTG TGAGCGCCAG GGGGTTACAC 50 
40 51 TTCTCACGGG TTAAAACCCA GACAACTTCA CGAGGGAACC ACGTGCCATT 100 
101 TTAACAGCGT ACGGTCGGGA TCGTGGGACG TCATTAAACG GAGTGGGTTG 150 
151 AGTATGTGAC TCTGTCACCC ATTTTCTG 



(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: HuN4 primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

1 CCCCGCGGGG CAGAATCCAA GTGAGTCAGA CACATTGCTC CCTCCCTGCT 50 
51 GCTGCCAGTC CATCTCTTTG CCAACAAACC TGCTTAAAAT GCCAAAGCTG 100 
101 GTCCAAAGTT TCAGGAAAAC AACTTCCGCC AGAGGGCACG TAGAGGGCAC 150 
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151 AGATGCTATA GATGCTTCTC TGACAAACAC TCCTGACCCC CTTGACAGAT 200 
201 TGGAAAATAC ATGGTTCAGA AAGGGTGAGA GATTTCAACT TGAGAAGTGA 250 
251 AACTAGGAAA AGATGGAAGG TGTCCGGATT TCTAGCTCAA GTCCACACAC 300 
301 TGCTTCTGCT GCGGTGACTA AATCGTGGCT GTGTTCTCAT CACCTGCCTC 350 
5 351 GCGGCGCGC 359 



(2) INFORMATION FOR SEQ ID NO: 8: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



15 



40 



50 



(ii) MOLECULE TYPE: HuN5 primer 



1 GGCGGGCCTG GGCACCGCGG AGGGGGGGCT TTTCTGCGCC CGGCGAAGCG 50 
51 TGGAACTTGC GCCCTGAGGC AGCGCGGCGA GACCAGTCCA GAGACCGGGG 100 
20 101 CGAGCCTCCT CAGGATTCCT CGCCCCAGTG CAGATGCTGT GAGCTTAGAC 150 
151 GAGGACAGGG CATGGCACTC GGCTTGGCCC GTAGTGGACG GTGTTTTTGC 200 
201 AGTCATGAAC CCAAACGCCG CAAACCTTGA CCGTTTCCCC ACCCGTGTTG T 
251 

25 (2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: HuN6 primer 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

1 TGAGAGCAGC ATCCTCCCCT GCGTGTGGTT CTCTAACTTA CCTCCTGTAT 50 
51 GGGGTCTGCG GACCCAGCAC ACCTCCCGGG CCCCCAAAAA ATTCCAGCTC 10 0 
101 AAGAGCCCTA AAAATCCTTA CCCTGNNAAA GTTTGAGCTT CTCCC 14 5 

(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 215 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: CaSl primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



1 ACGCCGGCCA CAGTTCTTCA GTGAAACGCT TCACTCTCTG GTCATAGAGG 50 
51 TAGGAAACTA TAGCTGTCCC AACTAAATGT CAGGACGAAT TAGCCCAGCT 100 
55 101 GGTCACGCTC ACAGTCACCG CCTCCACCAG ACTGAGCGAC CCTCCCAACG 150 
151 GGGTTTGCCG TGTTGGGAGG ACAGCGGAGT TTCGTTGCTG TGTCAATTTG 200 
201 TGTAGACGCG GCTGC 215 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CaS2 primer 



1 CTGCTCTCTT CTCTTCTTTT CCCCTTTCCT CTCCTCTCCC TTTCCTCAGG 50 
51 TCACAGCGGA GTGAATCAGC TCGGTGGTGT CTTTGTCAAC GGGCGGCCAC 100 
101 TGCCGGACTC CACCCGGCAG AAGATTGTAG AGCTAGCTCA CAGCGGGGCC 150 
151 CGGCCGTGCG ACATTTCCCG AATTCTGCAG GTGATCCTCC CGGCGCCGCC 2 00 
15 201 CCACTCGCCG CCCCCGCGGC 220 



(2) INFORMATION FOR SEQ ID NO: 12: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

25 

(ii) MOLECULE TYPE: CaS4 primer* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

30 1 GGGCGGCACG GAGGGAGTCA GGAGTGAGCC CGAAGATGGA GAGAAGTCGA 50 

51 TTCGCCCAGA GAACGCAAGA CGGTGGATCA GAGATGAGTC CCAGGAACCT 100 

101 CAGAGAGCGA GGCTGACAGG CCCGGGGAGA GGACCGGGCA GGGACAAACC 150 

151 AGCGGACAGA GCAGAGCGCG AAATGGTTGA GACCGGGAAG CGACCT 196 

35 (2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: Ms-SNuPE primer from pl6 promoter 
region 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
1 5'-GTA GGT GGG GAG GAG TTT AGT T-3 ' 22 

(2) INFORMATION FOR SEQ ID NO : 14 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
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10 



(ii) MOLECULE TYPE: Ms-SNuPE primer from plS promoter 
region 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
1 5'-TCT AAT AAC CAA CCA ACC CCT CC-3 

(2) INFORMATION FOR SEQ ID NO:15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
15 ' (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: Ms-SNuPE primer from pl6 promoter 
region 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

1 5* -TTT TTT TGT TTG GAA AGA TAT- 3' 21 

25 (2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: Ms-SNuPE primer from plS promoter 
region 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
1 5' -TTT TAG GGG TGT TAT ATT- 3 1 18 
40 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: Ms-SNuPE primer from pl6 promoter 
region 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
1 5' -TTT GAG GGA TAG GGT-3 1 15 

55 
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We claim: 

1 . A method for determining DNA methylation patterns at cytosine sites, 
comprising the steps of: 

(a) obtaining genomic DNA from a DNA sample to be assayed; 

(b) reacting the genomic DNA with sodium bisulfite to convert unmethylated 
cytosine residues to uracil residues while leaving any 5-methylcytosine residues unchanged to 
provide primers specific for the bisulfite-converted genomic sample for top strand or bottom 
strand methylation analysis; 

(c) performing a PCR amplification procedure using the top strand or bottom strand 
specific primers; 

(d) isolating the PCR amplification products; 

(e) performing a primer extension reaction using Ms-SNuPE primers, [ 32 P]dNTPs 
and Taq polymerase, wherein the Ms-SNuPE primers comprise a from about a 15 mer to about 
a 22 mer length primer that terminates immediately 5' of a single nucleotide to be assayed; and 

(f) determining the relative amount of allelic expression of CpG methylated sites 
by measuring the incorporation of different 32 P-labeled dNTPs. 

2. The method of claim 1 wherein the [ 32 P]dNTP for top strand analysis is 
[ 32 P]dCTP or [ 32 P]TTP. 

3. The method of claim 1 wherein the [ 32 P]dNTP for bottom strand analysis is 
[ 32 P]dATP or [ 32 P]dGTP. 

4. The method of claim 1 wherein the isolation step of the PCR products uses an 
electrophoresis technique. 

5. The method of claim 4 wherein the electrophoresis technique uses an agarose 

gel. 

6. The method of claim 1 wherein the Ms-SNuPE primer sequence comprises a 
sequence of at least fifteen but no more than twenty five bases having a sequence selected from 
the group consisting of GaLl [SEQ ID NO. 1], GaL2 [SEQ ID NO. 2], GaL4 [SEQ ID NO. 3], 
HuNl [SEQ ID NO. 5], HuN2 [SEQ ID NO. 6], HuN3 [SEQ ID NO. 7], HuN4 [SEQ ID NO. 
8], HuN5 [SEQ ID NO. 8], HuN6 [SEQ ID NO. 9], CaSl [SEQ ID NO. 10], CaS2 [SEQ ID 
NO. 1 1], CaS4 [SEQ ID NO. 12], and combinations thereof. 

7. A Ms-SNuPE primer sequence designed to anneal to and terminate immediately 
5' of a desired cytosine codon in a CpG target site comprising an oligonucleotide sequence of 
at least 15 base pairs and corresponding to a gene sequence located immediately 5' upstream 
from the CpG island that is frequently hypermethylated in promoter regions of somatic genes 
in malignant tissue. 

8. The Ms-SNuPE primer sequence wherein the primer sequence is from about 1 5 
to about 25 base pairs in length and selected from the group consisting of GaLl [SEQ ID NO. 
1], GaL2 [SEQ ID NO. 2], GaL4 [SEQ ID NO. 3], HuNl [SEQ ID NO. 5], HuN2 [SEQ ID 
NO. 6], HuN3 [SEQ ID NO. 7], HuN4 [SEQ ID NO. 8], HuN5 [SEQ ID NO. 8], HuN6 [SEQ 
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ID NO. 9], CaSl [SEQ ID NO. 10], CaS2 [SEQ ID NO. 11], CaS4 [SEQ ID NO. 12], and 
combinations thereof. 

9. A method for obtaining a Ms-SNuPE primer sequence designed to anneal to and 
terminate immediately 5' of a desired cytosine codon in the CpG target site, comprising finding 
a hypermethylated CpG island in a somatic gene from a malignant tissue or cell culture, 
determining the sequence located immediately 5' upstream from the hypermethylated CpG 
island, and isolating a 15 to 25 mer sequence 5' upstream from the hypermethylated CpG island 
for use as a Ms-SNuPE primer. 

10. A Ms-SNuPE primer comprising a 1 5 to 25 mer oligonucleotide sequence 
obtained by the process comprising, finding a hypermethylated CpG island in a somatic gene 
from a malignant tissue or cell culture, determining the sequence located immediately 5 f 
upstream from the hypermethylated CpG island, and isolating a 15 to 25 mer sequence 5' 
upstream from the hypermethylated CpG island for use as a Ms-SNuPE primer. 
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